Data Augmentation for Abstractive Query-Focused Multi-Document Summarization
نویسندگان
چکیده
The progress in Query-focused Multi-Document Summarization (QMDS) has been limited by the lack of sufficient largescale high-quality training datasets. We present two QMDS datasets, which we construct using data augmentation methods: (1) transferring commonly used single-document CNN/Daily Mail summarization dataset to create QMDSCNN dataset, and (2) mining search-query logs QMDSIR dataset. These datasets have complementary properties, i.e., real summaries but queries are simulated, while simulated summaries. To cover both these summary query aspects, build abstractive end-to-end neural network models on combined that yield new state-of-the-art transfer results DUC also introduce hierarchical encoders enable a more efficient encoding together with multiple documents. Empirical demonstrate our methods outperform baseline automatic metrics, as well human evaluations along attributes.
منابع مشابه
RelationListwise for Query-Focused Multi-Document Summarization
Most existing learning to rank based summarization methods only used content relevance of sentences with respect to queries to rank or estimate sentences, while neglecting sentence relationships. In our work, we propose a novel model, RelationListwise, by integrating relation information among all the estimated sentences into listMLE-Top K, a basic listwise learning to rank model, to improve th...
متن کاملA Query Focused Multi Document Automatic Summarization
The present paper describes the development of a query focused multi-document automatic summarization. A graph is constructed, where the nodes are sentences of the documents and edge scores reflect the correlation measure between the nodes. The system clusters similar texts having related topical features from the graph using edge scores. Next, query dependent weights for each sentence are adde...
متن کاملQuery Focused Abstractive Summarization: Incorporating Query Relevance, Multi-Document Coverage, and Summary Length Constraints into seq2seq Models
Query Focused Summarization (QFS) has been addressed mostly using extractive methods. Such methods, however, produce text which suffers from low coherence. We investigate how abstractive methods can be applied to QFS, to overcome such limitations. Recent developments in neural-attention based sequence-to-sequence models have led to state-of-the-art results on the task of abstractive generic sin...
متن کاملOntology and Query-Focused Multi-Document Summarization System
Due to the increasing growth of online information on the specific topic, Multiple Document Summarization (MDS) has become a non-trivial task. The MDS facilitates the user to understand the large volume of information in a short time by creating a concise and comprehensive summary. In addition, user’s query based MDS system provides a consistent summary, including the core of the information. T...
متن کاملExperiments in Cross Language Query Focused Multi-Document Summarization
The twin challenges of massive information overload via the web and ubiquitous computers present us with an unavoidable task: developing techniques to handle multilingual information robustly and efficiently, with as high quality performance as possible. Previous research activities on multilingual information access systems have studied cross-language information retrieval (CLIR), information ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: Proceedings of the ... AAAI Conference on Artificial Intelligence
سال: 2021
ISSN: ['2159-5399', '2374-3468']
DOI: https://doi.org/10.1609/aaai.v35i15.17611